Adplot: detection and visualization of repetitive patterns in complete genomes

نویسنده

  • Akito Taneda
چکیده

MOTIVATION Repetitive DNA sequences are abundant in genomes and efficient mining of significant repeats is important as the first step of repetitive sequence research. Although many computational tools for the purpose, either automatic or visualization ones, have been developed, detection and analysis of approximate repeats are still non-trivial task. RESULTS Auto Dot PLOT (Adplot), a dotplot-like repetitive pattern visualization program with a window filtering based on iid Bernoulli trials, is developed and applied to yeast chromosomes and human T cell receptor locus sequence. Typical examples found in yeast chromosomes 1 and 10 and a tandem repeat of periods longer than 10,000 bp in human T cell receptor locus are presented. A complex structure composed of both direct and palindromic repeats found in yeast chromosome 10 is also visualized as specific dot pattern. Computational time measured by a Pentium 3 PC for each yeast auto chromosome with a standard parameter setting is linearly scaled and below 10 s per one chromosome, indicating efficiency of the program. From the examples, it is shown that Adplot can visualize approximate local repeat structures and give us a diagnosis power for inferring a duplicational history of repeats. AVAILABILITY Adplot can be obtained by an e-mail request.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dot Plot Detects Repetitive Structures in DNA Sequences

Up to now, many algorithms and programs for repeat sequences detection (such as tandem repeats finder [2], REPuter [4], color coding [5], and so on) have been developed and applied to genome sequence analysis. In the present study, a visualization algorithm for detecting and analyzing the repetitive sequences in genomes is proposed. The present method is based on a dot plot between identical se...

متن کامل

Computation and Visualization of Degenerate Repeats in Complete Genomes

The repetitive structure of genomic DNA holds many secrets to be discovered. A systematic study of repetitive DNA on a genomic or inter-genomic scale requires extensive algorithmic support. The REPuter family of programs described herein was designed to serve as a fundamental tool in such studies. Efficient and complete detection of various types of repeats is provided together with an evaluati...

متن کامل

Molecular typing of avian Escherichia coli isolates by enterobacterial repetitive intergenic consensus sequences-polymerase chain reaction (ERIC-PCR)

BACKGROUND: Colibacillosis is one of the most economically important diseases of poultry worldwide. OBJECTIVES: This study was conducted to examine the clonal relatedness and typing of 95 avian Escherichia coli isolates by ERIC-PCR. METHODS: Sixty-three E. coli isolates from two common manifestations of colibacillosis (yolk sac infection and colisepticemia) and 32 isolates from feces of apparen...

متن کامل

DNA Fingerprinting Based on Repetitive Sequences of Iranian Indigenous Lactobacilli Species by (GTG)5- REP-PCR

Background and Objective: The use of lactobacilli as probiotics requires the application of accurate and reliable methods for the detection and identification of bacteria at the strain level. Repetitive sequence-based polymerase chain reaction (rep-PCR), a DNA fingerprinting technique, has been successfully used as a powerful molecular typing method to determine taxonomic and phylogenetic relat...

متن کامل

Computational methods for Multiple Genome Alignment and Synteny detection

Multiple genome alignments are useful to detect synteny, gene order, and large-scale genomic re-arrangements which help to understand genome evolution, divergence and the development of protein functions. However, aligning multiple whole genomes is very computationally intensive [3] and many genomes are only partially complete. Fast approximation algorithms have been developed to handle both th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 20 5  شماره 

صفحات  -

تاریخ انتشار 2004